Disambiguation of the Neuter Pronoun and Its Effect on Pronominal Coreference Resolution

نویسندگان

  • Véronique Hoste
  • Iris Hendrickx
  • Walter Daelemans
چکیده

Coreference resolution, determining the appropriate discourse referent for an anaphoric expression, is an essential but difficult task in natural language processing. It has been observed that an important source of errors in machine-learning based approaches to this task, is the wrong disambiguation of the third person singular neuter pronoun as either referential or non-referential. In this paper, we investigate whether a machine learning based approach can be successfully applied to the disambiguation of the neuter pronoun in Dutch and show a modest potential effect of this disambiguation on the results of a machine learning based coreference resolution system for Dutch.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Referential Versus Non-referential Use of the Neuter Pronoun in Dutch and English

This paper discusses a corpus-based investigation of the distribution of the thirdperson neuter singular pronoun in Dutch (“het”). We labeled all pronominal occurrences of “het” in a large corpus of documents. On the basis of the annotated corpora, we developed an automatic classification system using machine learning techniques to distinguish between the different uses of the neuter pronoun. A...

متن کامل

Stress, pauses, pronominal types and pronominal functions in Danish spoken data

In this paper we present a study of the relation between types of third personal singular neuter pronoun and their functions in Danish spoken data where stress information is marked so that personal and demonstrative occurrences of the pronouns can be distinguished. This study confirms that there are language specific differences in the way various types of pronoun are used to refer to abstract...

متن کامل

Developing Guidelines for the Annotation of Anaphors in the Chinese Treebank

This paper describes the CTB Coreference Annotation Guidelines for annotating pronominal anaphoric expressions in the Penn Chinese Treebank. The goals of the annotation are: to provide training data for learning-based pronoun resolution tools, and to provide a \gold" standard to be used in the evaluation of pronoun resolution algorithms. The choices that were made concerning the coindexing of p...

متن کامل

The Early Modern Genitive Its and Factors Involved in Genitive Variation

This article explores the variation between the emergent genitive its and the periphrastic form of it in Early Modern English, situating this case in the larger picture of English genitive variation. As previous studies have often focused on non-pronominal possessors (given that Present Day English pronominal possessors often appear prenominally, with limited variation), this early pronominal g...

متن کامل

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007